Modeling the influence of lime on the unconfined compressive strength of reconstituted graded soil using advanced machine learning approaches for subgrade and liner applications

In the field of soil mechanics, especially in transportation and environmental geotechnics, the use of machine learning (ML) techniques has emerged as a powerful tool for predicting and understanding the compressive strength behavior of soils especially graded ones. This is to overcome the sophisticated equipment, laboratory space and cost needs utilized in multiple experiments on the treatment of soils for environmental geotechnics systems. This present study explores the application of machine learning (ML) techniques, namely Genetic Programming (GP), Artificial Neural Networks (ANN), Evolutionary Polynomial Regression (EPR), and the Response Surface Methodology in predicting the unconfined compressive strength (UCS) of soil-lime mixtures. This was for purposes of subgrade and landfill liner design and construction. By utilizing input variables such as Gravel, Sand, Silt, Clay, and Lime contents (G, S, M, C, L), the models forecasted the strength values after 7 and 28 days of curing. The accuracy of the developed models was compared, revealing that both ANN and EPR achieved a similar level of accuracy for UCS after 7 days, while the GP model performed slightly lower. The complexity of the formula required for predicting UCS after 28 days resulted in decreased accuracy. The ANN and EPR models achieved accuracies of 85% and 82%, with R2 of 0.947 and 0.923, and average error of 0.15 and 0.18, respectively, while the GP model exhibited a lower accuracy of 66.0%. Conversely, the RSM produced models for the UCS with predicted R2 of more than 98% and 99%, for the 7- and 28- day curing regimes, respectively. The RSM also produced adequate precision in modelling UCS of more than 14% against the standard 7%. All input factors were found to have almost equal importance, except for the lime content (L), which had an average influence. This shows the importance of soil gradation in the design and construction of subgrade and landfill liners. This research further demonstrates the potential of ML techniques for predicting the strength of lime reconstituted G-S-M-C graded soils and provides valuable insights for engineering applications in exact and sustainable subgrade and liner designs, construction and performance monitoring and rehabilitation of the constructed civil engineering infrastructure.


Introduction
Soil stabilization is the process of changing or maintaining one or more soil qualities to improve a soil's engineering features and performance [1].This process is executed during the construction of civil engineering infrastructures like pavement foundation (e.g., subgrade) and landfill liners (compacted earth liners).The technique of adding a specific soil, chemical components, or other cementing material to a natural soil to improve one or more of its attributes is known as soil stabilization.Stabilization can be achieved by mechanically mixing natural soil and stabilizing material together to form a homogeneous mixture, or by applying the stabilizing substance to an undisturbed soil deposit and allowing it to infiltrate through soil voids to achieve interaction [2].Stabilization is a 5000-year-old notion [3] stated that ancient Egypt and Mesopotamia used stabilized earth roads and that the Greeks and Romans used lime as a stabilizer.The addition of cement or lime to soil can help to stabilize it.Such stabilization procedures improve the stabilized soil's varied engineering qualities, resulting in better construction material.The benefits of soil stabilization include increased soil strength, durability stiffness, changes in permeability, density, porosity, and volume, waterproofing, reduced surface abrasion, and reduced soil plasticity and swelling/shrinkage potential [4,5].Lime is frequently used as a soil stabilizing agent because it is readily accessible, inexpensive, and effective at enhancing the soil's strength and durability.The unconfined compressive strength, abbreviated as UCS, is frequently used to determine the efficacy of different soil stabilization techniques [6].Apart from other qualities of stabilized soil, scientists generally agree that the most important outcome of stabilization is unconfined compressive strength (UCS) [7].The stabilized soil's unconfined compressive strength is determined by the water and the cement amount in the mixture, the properties and types of the soil, and the curing and mixing conditions [8][9][10][11][12].The UCS determination of stabilized soil is critical in the construction and improvement design.Some research has been done to forecast the UCS of stabilized soil using input parameters including the binder concentration, water/binder ratio, and curing duration [13][14][15][16][17].
Many researchers have used soft computation and other techniques to develop models for estimating UCS values of stabilized soils [1,15,[18][19][20].In recent decades, computational intelligence methods that are accomplished of estimating the input-output non-linear relationships for many complicated issues have piqued interest [21].As previously stated, the UCS is influenced by several factors.Determining the UCS of soil samples requires time-consuming and labor-intensive lab work.Developing predictive models to deal with this problem could be advantageous.Many methods, such as traditional linear regressions, can be used to develop such behavioral models [22,23].However, regression techniques have several important problems, such as expecting a pre-specified nonlinear or linear relationship between the outputs and inputs, which is not necessarily the case [24].Machine learning (ML) techniques have been more well-liked in recent years as a result of their capacity to assess and anticipate intricate relationships between output and input variables.This ability has helped the techniques become more widely used.Three of the most popular machine learning methods used for this are Gaussian processes (GP), evolutionary programming (EPR), and artificial neural networks (ANN).GP is a method based on probability that employs a kernel function to model the relationship between input variables and output variables.It has been effectively implemented in a variety of applications, containing soil mechanics, to model the behavior of soil and predict its engineering properties.EPR is a method for population-based optimization that imitates the process of natural evolution in order to discover the optimum solution to a problem.In the discipline of soil mechanics, it has been used to enhance the design of foundations and to simulate the behavior of soil under a variety of loading conditions [25].
An Artificial Neural Network (ANN) is a computational model made up of various procedures modules in which neurons generate output values depending on the input values.ANN is getting popular in a variety of geotechnical engineering applications [26,27].Some researches, such as reported, [14,15,[28][29][30][31] have used ANN models to estimate the UCS value in chemical soil stabilization.The behavior of soil and its engineering properties can be predicted using ANN, which has found applications in a variety of fields, including soil mechanics.Analytical formulas depend on ANN models are more precise than multivariable nonlinear regression or high-performance multiple regression analysis.ANN, on the other hand, is regarded as a "black box" application.Complex transfer functions, such as logistic sigmoid and hyperbolic tangent sigmoid functions, are used to create prediction equations.As an outcome, the use of ANN-based prediction purposes is limited because they cannot easily calculate the output using the input values [32].As a result, in this study, the ANN approach is used to estimate a UCS prediction model.Despite the ANN models' reasonable performance, they need practical equations for calculating the results.Pharm et al. [27] used the ANN approach to develop the UCS prediction model concerning the specified parameters.A statistical analysis reveals that the suggested model created in this work is dependable and accurate, with a high relationship coefficient and minimal root mean squared errors.The most important variables impacting the UCS value, according to the ANN-based model, are and cement content the soil particles percentage passing filter 0.5 mm.Jahed Armaghoni et al. [33] developed three models to forecast UCS of granite: multiple regression analysis (MRA), adaptive neuro-fuzzy inference system (ANFIS), and artificial neural network (ANN).ANFIS is more precise than the other two models, they discovered.According to the test data, it is also proven that ANN outperforms MRA.Sharma et al [34] established four numerous and simple linear regression models to forecast UCS of structured artificial soil.All of their models have an R 2 of greater than 0.9, indicating that they are accurate.More sophisticated models with more input variables, on the other hand, showed greater accuracy.
Genetic programming (GP) is a type of managed machine learning approach based on Darwin's theory of evolution [35].It's a different way of looking at behavior modeling.Geneexpression programming (GEP) is a division of GP that uses a computer program to generate a solution to a problem [36], and it is the method most typically employed in geotechnical engineering [37].GEP also selects populations, depending on fitness purpose and presents them with a gene via numerous operators [38].Without making any expectations about the likely functional connections structure, GEP can construct strong prediction functions [39].The GEP model is a reliable, strong, and precise forecasting method.Furthermore, GEP-based equations are more practical and transparent than ANN-based formulas.As a result, the predictive proposed equations derived from the GEP model may be ready for use.
Cement, asphalt, and lime were combined with weak soil in this study to rise the strength of the soil specimens.The impacts of additive contents (i.e.cement, asphalt, and lime) on the UCS value were investigated using UCS tests on stabilized specimens.The effective majors are also used to estimate the UCS value using MLR and NLR methods.The prediction formulas among variables and the relative relevance of input parameters were measured using ANNs and GAs analysis.The UCS data obtained from soaked and unsoaked mixtures were combined in this study.This means that the UCS values obtained from the soil-lime mixture after it had been soaked were combined with the UCS values obtained from the mixture before it had been soaked.This method is frequently utilized in soil mechanics research in order to provide a more in-depth investigation of the efficiency of various soil stabilization strategies.In order to assess the stocked UCS's contribution to the improvement of the lime reconstituted mixture, the ML techniques of GP, EPR, and ANN were utilized.The behavior of soil is highly complex and nonlinear, and traditional analytical and empirical approaches may not be able to capture all of the underlying relationships between output and input variables.Because of this, the usage of ML techniques is particularly useful in soil mechanics [40].
GP, EPR, and ANN can all model complex relationships and make predictions based on the input data.GP is a non-parametric method that suppose no functional particular form for the correlation between output and input variables.Therefore, it is capable of capturing nonlinear relationships and managing chaotic data.EPR is a population-based optimization method that can seek a vast search space for the optimal solution.This capability makes EPR a useful modeling tool for complex systems [41].Due to its adaptability, ANN has been widely employed to model intricate relationships in numerous fields, including soil mechanics.The stored UCS data for the soil-lime me ´lange served as input data for machine learning techniques, and ML models were trained using the input data to predict the UCS values.The ML methodologies utilized the accumulated UCS data for the soil-lime mixture as input data.After instructing the models, they were used to determine if the addition of calcium to the soil increased UCS values [42].
The use of both unsoaked and soaked UCS data in the study allows for a more comprehensive evaluation of the soil-lime mixture's efficacy, as the soaked UCS values are more indicative of the mixture's performance in moist conditions.This is due to the fact that the unsoaked UCS values reflect the mixture's performance in dry conditions [43].
ML methods can also be employed to optimize soil stabilization systems by identifying the optimal combination of stabilizing compounds, soil varieties, and other variables.This can be accomplished by identifying the optimal combination of factors.Engineers can use this information to design systems that are more effective, efficient, and cost-effective, while still meeting specific performance requirements [44].However, there are some limitations to the application of ML methods in soil mechanics.To train machine learning models, high-quality data is required, which can be considered a drawback of the technology.In the field of soil mechanics, where data acquisition can be time-consuming and costly, this can be challenging.Moreover, ML models can be challenging to interpret, making it difficult to comprehend the underlying mechanisms that regulate soil behavior [45].This can make understanding the mechanisms underlying soil behavior more difficult.Overall, ML techniques provide a robust and adaptable method for evaluating the effectiveness of soil stabilization techniques, and they can be combined with other techniques to provide a more complete understanding of soil behavior.Additionally, these methodologies can be used to evaluate the effectiveness of soil stabilization techniques [46].
The application of ML techniques in this study provides an essential tool for evaluating the effectiveness of soil stabilization techniques and can aid engineers in designing more efficient and effective soil stabilization systems.In addition to the previously mentioned ML techniques, additional methods exist for evaluating the effectiveness of soil stabilization techniques [47].Techniques that fall into this category contain empirical, analytical, and numerical approaches [48].Empirical methods are frequently used in the field of soil mechanics to develop correlations between the engineering properties of soil and other factors such as soil type, moisture content, and confining pressure [49].Statistical analysis of experimental data forms the basis for these empirical methodologies [12].Despite providing estimates of soil behavior in a fast and straightforward manner, it is possible that these techniques are limited in their ability to capture complex relationships [41,50].
Analytical methods are founded on mathematical models that characterize the behavior of soil using equations derived from fundamental principles of soil mechanics [51].These mathematical models provide the foundation for analytical methods [52].These methodologies can enhance the design of soil stabilization systems by providing a deeper understanding of soil behavior.However, in order to simplify them, it may be necessary to make assumptions that reduce their precision [53].The foundation of numerical methods is the use of numerical models, such as finite element analysis (FEA) and discrete element method (DEM), in computer simulations of soil behavior [54].These methods are supported by computer simulations [48].Nevertheless, these methods can be computationally intensive and require a substantial amount of knowledge to be utilized effectively [55].In contrast, they can provide precise and exhaustive predictions of soil behavior.When making a decision regarding soil stabilization techniques, it is essential to consider the specific application, the available data, the desired level of precision and complexity, and the evaluation method [56].The implementation of ML techniques offers a method that is both robust and adaptable, and it can be combined with other techniques to provide a more comprehensive analysis of the behavior of soil.
In the discipline of soil mechanics, techniques based on machine learning have several advantages over more conventional methods.In the field of soil mechanics, where data can be chaotic and highly variable, their ability to manage large and complex datasets is especially advantageous [57][58][59].Traditional methods may not be able to identify nonlinear relationships between input and output variables, whereas ML techniques can [58].The ability of machine learning techniques to learn from data and adapt to changing conditions is an additional advantage of using these methods [57].This is particularly useful in the field of soil mechanics, which is significant because the behavior of soil is highly dependent on a number of variables, including its moisture content, confining pressure, and loading rates.Machine learning techniques can learn from data collected under diverse conditions and predict future conditions [47].
In this research paper, the strengths at 7-and 28-days curing period have been forecasted by using four intelligent techniques for a reconstituted soil treated with lime at different dosages.This is for the purpose of subgrade (as a transportation geotechnic structure) and landfill liner (as an environmental geotechnic infrastructure) design and construction.These structures have special benefits they offer to the overall life of humanity according to the UNSDGs.Transportation geotechnics and environmental geotechnics are two specialized branches of geotechnical engineering that focus on the application of soil mechanics, rock mechanics, and other geotechnical principles to the design, construction, and maintenance of transportation infrastructure and environmental projects [15].Transportation geotechnics involves the application of geotechnical engineering principles to the design and construction of transportation infrastructure such as highways, railways, airports, and ports.Subgrade and pavement design: This involves evaluating the properties of the underlying soil or rock to ensure that it can support the loads imposed by traffic and environmental conditions, and designing suitable pavement structures.Slope stability and embankment design: Assessing the stability of natural and man-made slopes, as well as designing and constructing embankments that can support transportation infrastructure [17].Foundation design for transportation structures: Ensuring that transportation structures such as bridges, tunnels, and retaining walls have stable foundations in a variety of soil and rock conditions.Ground improvement techniques: Implementing methods to strengthen or stabilize the ground, such as soil compaction, reinforcement, grouting, or soil replacement, to improve the performance of transportation infrastructure [35].On the other hand, environmental geotechnics focuses on the application of geotechnical engineering principles to address environmental challenges, such as waste management, contaminated site remediation, and sustainable infrastructure development [23].Landfill design and management: Evaluating the geotechnical properties of the site for landfill construction, designing liners and leachate collection systems to prevent environmental contamination, and managing the stability and settlement of waste fills.Contaminated site remediation: Applying geotechnical techniques to remediate sites contaminated with pollutants, such as heavy metals, hydrocarbons, or hazardous chemicals, by using methods like soil stabilization, containment, and in-situ treatment.Geotechnical aspects of sustainable infrastructure: Incorporating geotechnical considerations into the design and construction of sustainable infrastructure, such as green roofs, permeable pavements, and engineered natural systems for stormwater management [42].Both transportation geotechnics and environmental geotechnics require a thorough understanding of soil behavior, groundwater conditions, and the interaction between soil and man-made structures [26].Many more references have been made that studied the use lime in single and or combination of other cementitious materials such as the application of cement/ lime combination with pozzolans accompanied with crushed stone waste to improve the strength of soil [60], the application of lime-nano-silica combination [61], the treatment of silty soil with lime considering the strength improvement comparison between UCS and splitting tensile strength [62], and the application of bagasse-lime combination, where ANN was also utilized to model the behavior of this material combination in soil [63].Yet, none tried the use of the novel response surface methodology (RSM) in the combination of other regression techniques to model these problems.Also, the application of lime has continued to come in the combination of other materials.But, in the present work, lime has been applied as the only cementitious material and a combination of four machine learning techniques have been used to predict the strength development of the treated soil considering its gradation pattern.Geotechnical engineers working in these fields play a crucial role in ensuring the safe, cost-effective, and sustainable development of transportation and environmental infrastructure.For the need to solve most environmental geotechnics problems, the more pronounced of which are those related to landfills and pavement foundations across the world, this work has been undertaken to propose more reliable and robust mathematical models based on advanced machine learning techniques.These techniques are flexible because of the advantage of utilizing proposed closed-form equations to apply the models manually as well as smartly.Landfills and pavement subgrade systems require immediate attention from design to construction and to usage over the period to monitor the optimized utilization of the sustainable materials used in their construction.So, this research project presents a potential for use in this area.Many other research works are stated above have been carried but none presents a combination of the four techniques applied in this paper.Meanwhile, the flowchart of the present research project is illustrated in Fig 1.

Data collection and statistical analysis
The complete database constitutes of an open database of 136 records, which were collected from experimentally tested samples of soil stabilized with lime deposited at the US soil stabilization database, which can be found in the cited literature [64].These soils were collected across the world especially the United States in the East central Iowa, North central Florida, Illinois, Kansas, etc. and classified as dune sand, kaolinite clay, illinite clay, montmorillonite clay, alluvial, sand loess, friable loess, plastic loess, leached Kansas till, unleached Kansas till, etc. [64].The lime reconstituted graded soil samples were collected at different locations within US soil and other points within the upper greater Europe giving a global representation of soils.Each record contains the following data: • Unconfined compressive strength after 7 days (kPa), UCS7 • Unconfined compressive strength after 28 days (kPa), UCS28 The collected records were partitioned into a training set of � 75% (100 records) and validation set of � 25% (36 records) observing the conditions of the k-fold cross-validation to overcome under-fitting issues in the model.Tables 1 and 2 include the complete dataset for the training and validation respectively, while Tables 3 and 4 summarize their statistical

Research program
Three different Artificial Intelligent (AI) techniques and one symbolic machine learning trained technique were used to predict the unconfined compressive strengths after 7 and 28 days (UCS7, UCS28) of open-air curing of the lime reconstituted soil using the collected, sorted and organized database.These AI techniques are the Genetic Programming (GP), three models trained differently of Artificial Neural Network (ANN) and polynomial regression optimized using genetic algorithm which is known as the Evolutionary Polynomial Regression (EPR) [49][50][51].The symbolic machine learning trained technique is known as the response surface methodology due to its simple interface of proposing a closed-form equation that allows researchers to apply its model automatically and manually [52][53][54][55][56][57][58][59].All the four (4) developed models were deployed to predict (UCS7, UCS28) in (kPa) using the soil grading proportions from Gravel, Sand, Silt, and Clay and additive; Lime contents (G, S, M, C & L).Each of the four developed models was based on a different approach: an evolutionary approach for GP, mimicking biological neurons for ANN, an optimized mathematical regression technique for EPR and symbolic regression interface algorithm for the RSM.However, all the models were assessed for prediction accuracy based on the Sum of Squared Errors (SSE) and the determination coefficient known as the R-squared value (R 2 ).In the data partitioning, the k-fold cross-validation has been applied to solve problems of under-fitting due to the size of the data points [58,59].The accuracies of the developed models were evaluated by comparing the SSE between predicted and calculated shear strength parameter values based on the results of each model.

Genetic Programming (GP)
Genetic programming (GP), the framework of which is illustrated in Fig 3 is a type of evolutionary algorithm that is used to automatically generate computer programs to solve problems or perform tasks [52].It is a machine learning technique that draws inspiration from the process of natural selection and genetic evolution [53].The basic idea behind genetic programming is to create a population of candidate computer programs, represented as trees or graphs, and then use evolutionary principles such as selection, crossover, and mutation to evolve and improve these programs over successive generations until a desired level of performance is achieved.Representation: In genetic programming, computer programs are typically represented as trees or graphs, with nodes representing operations or functions, and edges representing the flow of data or control between operations.Initialization: The process starts by creating an initial population of random programs [54].Each program is evaluated based on its ability to solve the given problem or perform the desired task.Selection: Programs from the current population are selected for reproduction based on their fitness, which is a measure of how well they perform the task.This is typically done using a fitness function that quantifies the performance of each program.Crossover: Selected programs are combined through crossover, a process that mimics genetic recombination in nature [55].During crossover, sub-trees or sub-graphs from two parent programs are exchanged to create new offspring programs.Mutation: Random changes are introduced to the offspring programs through mutation, simulating genetic variation.This helps in exploring new regions of the search space and preventing premature convergence to suboptimal solutions.Evaluation: The newly created programs are evaluated using the fitness function to determine their performance on the given task.Termination: The evolutionary process continues for a certain number of generations or until a termination condition is met, such as reaching a satisfactory level of performance or running out of computational resources [54].Genetic programming has been successfully applied to a wide range of problems, including symbolic regression, automatic program synthesis, control system design, and pattern recognition [52][53][54][55][56].It is a powerful approach for automatically discovering solutions to complex problems without the need for human-designed algorithms.

Artificial Neural Network (ANN)
An artificial neural network (ANN), the typical framework if which is illustrated in Fig 4 is a computational model inspired by the structure and function of biological neural networks, such as the human brain [49].It is a powerful machine learning technique used for solving complex problems such as pattern recognition, classification, regression, and more [50].Structure: ANN consists of interconnected nodes, called neurons or units, organized in layers.Typically, there are three types of layers: input layer, hidden layers, and output layer [51].The connections between neurons are associated with weights that are adjusted during the training process.Learning: ANNs learn from data through a process called training.During training, the network is presented with input data, and the weights of the connections are adjusted based on the network's output and the expected output [50].This process is often performed using optimization algorithms such as gradient descent and its variants.Activation Function: Each neuron in a neural network typically applies an activation function to the weighted sum of its inputs.Common activation functions include sigmoid, tanh, ReLU (Rectified Linear Unit), and their variants [49].Types of Neural Networks: There are various architectures of neural networks, such as feedforward neural networks (the most basic type), convolutional neural networks (CNNs) for image processing, recurrent neural networks (RNNs) for sequential data, and more advanced architectures like deep neural networks (DNNs) and generative adversarial networks (GANs) [50].Applications: ANNs are used in a wide range of applications, including image and speech recognition, natural language processing, recommendation systems, financial forecasting, medical diagnosis, and many other fields where complex pattern recognition and prediction tasks are required [51].Challenges: Training a neural network can be computationally intensive and requires a large amount of labeled data.Overfitting, where a model performs well on training data but poorly on unseen data, is a common challenge that needs to be addressed through techniques like regularization and cross-validation.Overall, artificial neural networks have proven to be highly effective in solving complex problems, and their capabilities continue to expand with ongoing research and advancements in the field of deep learning.

Evolutionary Polynomial Regression (EPR)
Evolutionary Polynomial Regression (EPR) is a non-linear regression technique that uses a genetic programming approach to evolve mathematical models [49].It was proposed by Dr. Nordin Zakaria in the early 1990s.EPR combines the concepts of genetic algorithms and polynomial regression to automatically evolve a mathematical model that best fits a given dataset.The algorithm starts with a population of random mathematical expressions (polynomials) and uses genetic operators such as selection, crossover, and mutation to evolve these expressions over several generations [51].During the evolutionary process, the algorithm evaluates the fitness of each mathematical expression by comparing its performance in fitting the dataset [50].The fittest expressions are then selected to produce offspring through genetic operators, and this process is repeated for multiple generations until a satisfactory mathematical model is obtained [49].EPR has been used in various fields, including engineering, economics, and environmental science, to model complex relationships between input variables and output data [50].It has the advantage of being able to automatically discover the structure of the mathematical model, making it particularly useful when the underlying relationship between variables is not well understood or when traditional regression techniques may not be suitable [49].However, it's worth noting that EPR, like other evolutionary algorithms, can be computationally intensive and may require careful parameter tuning to achieve optimal results.Additionally, the interpretability of the evolved models can be a challenge, as the resulting mathematical expressions may be complex and difficult to understand.

Models' evaluation indices
Machine learning model performance evaluation is a critical step in the development and deployment of machine learning models.It involves assessing how well a trained model performs on new, unseen data.There are several methods for evaluating the performance of machine learning models, and the choice of method depends on the type of problem and the nature of the data [49].Here are some commonly used techniques for evaluating machine learning model performance: Train/Test Split: The simplest method for evaluating model performance is to split the available data into a training set and a testing set [50].The model is trained on the training set and then evaluated on the testing set to assess its performance on unseen data.Cross-Validation: Cross-validation is a technique used to assess how well a model generalizes to new data [51].It involves splitting the data into multiple subsets, training the model on a combination of these subsets, and then evaluating it on the remaining subset.This process is repeated multiple times, and the results are averaged to obtain a more reliable estimate of the model's performance.Performance Metrics: Various performance metrics can be used to evaluate the performance of machine learning models, depending on the nature of the problem [49].Common metrics include accuracy, precision, recall, F1 score, area under the receiver operating characteristic (ROC) curve (AUC-ROC), and mean squared error (MSE), among others.Confusion Matrix: For classification problems, a confusion matrix can be used to visualize the performance of a model by showing the number of true positive, false positive, true negative, and false negative predictions [51].ROC Curve and Precision-Recall Curve: These curves are used to evaluate the performance of binary classification models and visualize the trade-off between true positive rate and false positive rate, or precision and recall, respectively [50].Bias-Variance Trade-off: Understanding the bias-variance trade-off is crucial for evaluating model performance.A model with high bias may underfit the data, while a model with high variance may overfit the data.Balancing bias and variance are important for creating a model that generalizes well to new data [49].Hyperparameter Tuning: Evaluating model performance often involves hyperparameter tuning, which involves adjusting the settings of a model to optimize its performance.Techniques such as grid search and random search can be used to find the best hyperparameters for a given model.It's important to note that the choice of evaluation method and performance metric depends on the specific machine learning problem at hand, and there is no one-size-fits-all approach.Additionally, it's important to consider the implications of the chosen evaluation method on the overall goals of the machine learning project [50].Meanwhile, data partitioning and k-fold cross-validation are both techniques used in machine learning for model evaluation and selection.Each approach has its own effects and considerations, and they can be complementary in addressing various challenges in model training and assessment.In data partitioning, the dataset is typically divided into two subsets: a training set and a test set.The training set is used to train the model, while the test set is used to evaluate the model's performance on unseen data [51].Commonly, data partitioning involves a 70/30 or 80/20 split, where the larger portion of the data is used for training and the smaller portion for testing.This approach is simple and easy to implement.K-fold cross-validation involves dividing the dataset into K subsets (folds) [50].The model is trained and evaluated K times, each time using a different fold as the validation set and the remaining folds as the training set.Performance metrics are averaged across the K iterations to obtain a final performance estimate.K-fold cross-validation provides a more comprehensive assessment of the model's performance, as it uses the entire dataset for both training and validation.Data partitioning can lead to variability in model performance estimates, especially when the test set is small [49].The performance of the model may heavily depend on which data points end up in the test set.K-fold cross-validation provides a more reliable estimate of model performance, as it averages performance across multiple validation sets, reducing the impact of variability in the test set [50].With data partitioning, a portion of the data is reserved solely for testing, which means that less data is available for model training K-fold cross-validation makes more efficient use of the available data, as each data point is used for both training and validation at some point during the K iterations [50].Data partitioning may lead to a risk of overfitting to the test set, especially when the test set is relatively small [49].K-fold cross-validation can help mitigate the risk of overfitting to a single test set, as the model is evaluated on multiple validation sets, providing a more robust assessment of its generalization performance [49].In practice, both techniques can be used in combination.For example, a dataset can be partitioned into a training set and a holdout test set, and K-fold cross-validation can be applied to the training set for model selection and hyperparameter tuning [49].This combined approach allows for robust model evaluation while still reserving a separate test set for final model assessment.K-fold cross-validation is a powerful technique for assessing and mitigating overfitting in machine learning models.Overfitting occurs when a model learns to perform well on the training data but does not generalize well to unseen data [51].K-fold cross-validation helps to address overfitting in the following ways: By using K-fold cross-validation, the model is evaluated multiple times on different subsets of the data.This process allows for a more comprehensive understanding of how well the model generalizes to unseen data [50].If a model performs well across all K folds, it is an indication that the model is less likely to be overfitting [49].When a model is trained and evaluated on a single train-test split, the performance estimate can be highly dependent on which data points end up in the training set and which end up in the test set.This can lead to high variance in the performance estimate.K-fold cross-validation helps to reduce this variance by averaging performance across K different validation sets, providing a more stable estimate of model performance [49].In each fold of K-fold cross-validation, every data point is used for both training and validation.This ensures that all data points contribute to the evaluation of the model, which can help in identifying overfitting tendencies that may not be apparent when using a single train-test split [50].K-fold cross-validation can be used to compare the performance of multiple models and select the one that generalizes best to unseen data.This can help in choosing a model that is less prone to overfitting [50].
When tuning hyperparameters of a model, K-fold cross-validation can be used to find the optimal settings while guarding against overfitting to the validation set.Overall, K-fold cross-validation is a valuable tool for assessing and addressing overfitting in machine learning models [50].It provides a more robust evaluation of model performance and helps in selecting models that are more likely to generalize well to new data.

Sum of Squared Errors (SSE).
The sum of squared errors (SSE), also known as the residual sum of squares (RSS), is a commonly used metric for evaluating the performance of a regression model.It is a measure of the discrepancy between the observed values and the values predicted by the model [59].In the context of linear regression, the SSE is calculated by taking the difference between each observed target value (i.e., the actual value in the dataset) and the corresponding predicted value (i.e., the value predicted by the regression model), squaring each difference, and then summing up all these squared differences [58,59].Mathematically, the SSE for a regression model with n data points can be expressed as: Where: y i represents the observed value of the target variable for the i-th data point, y ¯represents the predicted value of the target variable for the i-th data point, and the summation is taken over all n data points.The SSE is a measure of the variability of the data that is not explained by the regression model [49].A lower SSE indicates a better fit of the model to the data, as it means that the model's predictions are closer to the actual observed values.The SSE is often used in the context of ordinary least squares (OLS) regression, where the goal is to minimize the SSE to find the best-fitting line or hyperplane for the given data [50].Minimizing the SSE is equivalent to finding the parameters of the regression model that provide the best fit to the data in terms of minimizing the squared differences between the observed and predicted values [59].While the SSE is a useful metric for evaluating the performance of regression models, it is important to consider other metrics as well, such as R-squared and others, to gain a comprehensive understanding of the model's performance.
3.4.2R-squared value.R-squared, often denoted as \ (R^2 \), is a statistical measure that represents the proportion of the variance in the dependent variable that is predictable from the independent variable(s) in a regression model [59].In the context of linear regression, it is a measure of how well the independent variables explain the variability of the dependent variable.The \ (R^2 \) value is calculated as the ratio of the explained sum of squares (ESS) to the total sum of squares (TSS), and is defined as: Where: SSE denotes the sum of squared errors (also known as residual sum of squares), which measures the discrepancy between the observed values and the values predicted by the model.TSS represents the total sum of squares, which measures the total variance in the dependent variable.Alternatively, the \(R^2 \) value can also be calculated as the squared correlation coefficient between the observed and predicted values of the dependent variable [58].This interpretation underscores the notion that \(R^2 \) measures the proportion of the variance in the dependent variable that is explained by the independent variables in the model.The \(R^2 \) value ranges from 0 to 1, with: \(R^2 = 0 \) indicating that the independent variables do not explain any of the variability of the dependent variable.\(R^2 = 1 \) indicating that the independent variables explain all of the variability of the dependent variable [59].Interpretation of \(R^2 \): A higher \(R^2 \) value indicates that a larger proportion of the variance in the dependent variable is explained by the independent variables, suggesting a better fit of the model to the data.A lower \ (R^2 \) value indicates that the independent variables provide little explanatory power for the dependent variable [57].It's important to note that while \ (R^2 \) is a useful measure for assessing the goodness of fit of a regression model, it should be used in conjunction with other metrics, such as adjusted \ (R^2 \), mean squared error (MSE), and others, to gain a comprehensive understanding of the model's performance and to avoid potential pitfalls associated with overfitting or underfitting.

Response Surface Methodology (RSM)
Response Surface Methodology (RSM) the framework of which is shown in Fig 5 is a collection of mathematical and statistical techniques used for designing experiments, building empirical models, and finding the optimal conditions for a process [52].It is commonly used in scientific and engineering fields, particularly in areas such as chemical engineering, manufacturing, and product development.Modeling and optimizing a process: RSM is used to develop mathematical models that represent the relationship between input variables (factors) and the output response of interest [53].These models are then used to optimize process conditions to achieve desired outcomes.Experiment design: RSM involves the design of experiments to efficiently explore the relationships between multiple input variables and the response variable [54].This typically involves conducting a series of experiments with carefully chosen factor settings to collect data for model building.Finding optimal process conditions: Once the mathematical models are developed, RSM techniques are used to identify the optimal or near-optimal settings of the input variables that lead to the desired response [55].Factorial designs: These are experimental designs in which all possible combinations of factor levels are studied.They are used to identify the main effects and interactions of the factors on the response [56].Central composite designs: These designs involve a combination of factorial points and center points, and are used to fit a second-order polynomial model.They are effective for estimating curvature and interaction effects [57].Box-Behnken designs: These designs are used to fit a secondorder model without needing to study all possible combinations of factor levels.They are particularly useful when the number of factors is moderate.Analysis of variance (ANOVA): ANOVA is used to analyze the significance of the factors and their interactions on the response variable [54].Response surface optimization: Optimization techniques are used to find the optimal settings of the input variables that lead to the best or desired response [56].Overall, Response Surface Methodology provides a systematic and efficient approach for understanding and optimizing complex processes by using empirical models and statistical techniques.

GP model technique
The developed GP model has four levels of complexity.The population size, survivor size and number of generations were 100 000, 30 000 and 100 respectively.Eqs 3 and 4 presented the output formula for (UCS7and UCS28) respectively.The average errors % of total dataset are (0.24, 0.34), while the R 2 values are 0.777 and 0.650 in order.The proposed closed-form equation suggests that the proportion of silt and clay are more decisive in the behavior of the compressive strength of the lime reconstituted soil at the 7 days curing stage.

ANN model technique
A predictive model was developed using ANN technique to predict both UCS7 and UCS28 values.It used normalization method (-1.0 to 1.0), activation function (Hyper Tan) and "Back propagation" (BP) training algorithm.The ANN developed model weight matrix is presented in Table 5.The used network layout is illustrated in Fig 6 while the weight matrix of the model is showed in Table 3.The average errors in percentage for the total dataset were found to be 12% and 15%, with corresponding R 2 values of 0.952 and 0.947, respectively.The relative importance values for each input parameter are illustrated in Fig 7, which indicated that all factors have almost the same importance for both UCS7 and UCS28 except the Lime content (L) which has less influence.It further shows that sand (S) and clay (C) are the most impactful on the strength response of the lime-reconstituted graded soil, which corroborates with previous reports [51,65].This shows that the finer materials perform better with the strength improvement of landfill liners and subgrades under lime treatment [51].

EPR model technique
Finally, the developed EPR model was limited to 6 th level polynomial, for 5 inputs, there are 462 possible terms (252+126+56+21+6+1 = 462) as follows: GA technique was applied on these 462 terms to select the most effective 47 terms to predict the values of UCS7 and 50 terms to predict UCS28 values.The average error percentages and R 2 values for UCS7 and UCS28 were found to be 12% and 18% and 0.955 and 0.923, respectively.The relations between calculated and predicted values are shown in Figs 8 and 9 and the results of all the developed models are summarized in Table 6.In  The ANN model can only be applied within the intelligent interface because it didn't produce a closed-form equation with which a manual application is executed.

RSM model analysis
The Predicted R 2 for the UCS 7 of 0.9879 is in reasonable agreement with the Adjusted R 2 of 0.8805; i. e. the difference is less than 0.2.Adeq precision measures the signal to noise ratio.A ratio greater than 4 is desirable.Your ratio of 14.169 indicates an adequate signal.This model  can be used to navigate the design space.The Predicted R 2 for the UCS 28 of 0.9942 is in reasonable agreement with the Adjusted R 2 of 0.8780; i.e., the difference is less than 0.2.Adeq precision measures the signal to noise ratio.A ratio greater than 4 is desirable.Your ratio of 14.722 indicates an adequate signal.This model can be used to navigate the design space.These are presented in Table 7.The UCS7 equation (Eq 8) in terms of actual factors can be used to make predictions about the UCS response for given levels of each factor with high accuracy and adequate precision.Here, the levels should be specified in the original units for each factor.This UCS7 equation should not be used to determine the relative impact of each factor because the coefficients are scaled to accommodate the units of each factor and the intercept is not at the center of the design space.The UCS28 equation (Eq 9) in terms of actual factors can be used to make predictions about the lime reconstituted soil strength response for given levels of each factor with high performance accuracy.Here, the levels should be specified in the original units for each factor.This equation should not be used to determine the relative impact of each factor because the coefficients are scaled to accommodate the units of each factor and the intercept is not at the center of the design space.
The optimized model representation are shown in Figs 11-21 and these succinctly illustrate the normal plot of residuals for normal percentage probability and externally studentized residuals for the unconfined compressive strength RSM model, the illustrative representation  previous works where RSM had been applied as the decisive model [52][53][54][55].Comparatively, the perfromance level of the RSM is to be considwred seriosly in the deisgn and cinstruction of civil engineering structures like the road pavement subgrade and landfill liner due to its ability to propose a closed-form equation at over 95% accuracy and over 14% adequate precision, that could be applied manually and automatically during the design and constructted structures performance monitoring [49,50].In

Conclusions
This study presents three models utilizing Artificial Intelligence (AI) techniques, namely Genetic Programming (GP), Artificial Neural Network (ANN), and Evolutionary Polynomial Regression (EPR), and one utilizing a symbolic algorithm known as the Response Surface Methodology (RSM) to predict the unconfined compressive strength after 7 and 28 days (UCS7 and UCS28 in kPa) for lime-stabilized soil.The models incorporate input variables such as Gravel, Sand, Silt, Clay, and Lime contents (G, S, M, C, L).The following key conclusions can be drawn from comparing the accuracies of the developed models: • Both ANN and EPR demonstrated similar levels of accuracy, reaching approximately 88% for the compressive strength after 7 days (UCS7), while the GP model exhibited a lower accuracy level of 76%.
• The prediction accuracy of the compressive strength after 28 days (UCS28) was lower compared to UCS7 due to the increased complexity of the required formula.The accuracy for ANN and EPR models was approximately 85% and 82%, respectively.Conversely, the GP model exhibited the lowest performance with an accuracy of 66.0%.
• The results indicate that all factors have nearly equal importance for both UCS7 and UCS28, except for the lime content (L), which has a lesser influence.
• Similar to other regression techniques, the generated formulas are valid within the considered range of parameter values.Beyond this range, it is essential to verify the prediction accuracy.
• However, the RSM model compared well with the results of the AI-powered models in performance accuracy and further proposed a closed-form equation for manual and automatic application in the design of optimized utilization of lime in soil treated for the purpose of subgrade and landfill liner construction.This is very important in practice as quicker manual applications are needed to verify optimized material usage for the best performance.
• This research emphasizes the immense potential of ML techniques in predicting the unconfined compressive strength of lime reconstituted graded soil mixtures.It further emphasises the impactful influence of the finer soils in the sand and clay categories on the strength of the studied lime reconstituted soil.The findings contribute to a better understanding of the behavior of lime-stabilized soil, offering valuable insights for engineering applications in the field of soil mechanics.However, the results and validity of the models are within the selected database and the soil treated with lime.So, future research work is expected to extend towards studying other sustainable cementitious materials database applied in soil for the design and construction of subgrade and landfill liners.

Fig 7 .
Fig 7. Relative importance of input parameters.https://doi.org/10.1371/journal.pone.0301075.g007 Fig 10,  the accuracies of the developed models are compared by using Taylor charts for the UCS7 and UCS28.The data entries are more concentrated around 500 to 1500 kPa for the UCS7 and around 900 to 2700 kPa for the UCS28 for the training and validation entries of the three models as shown in Figs8 and 9.The best fit for the ANN with the line equation of y = 0.987x and y = 980x for the 7 days cured unconfined compressive strength (UCS7) and the 28 days cured unconfined compressive strength (UCS28), respectively show the best efficient model execution even though the ANNs models did not produce closed-form equations, which may allow for a manual application of the superior model.The Taylor diagram agrees with the performance model shown in Figs8 and 9, which shows the ANN and EPR in the 0.95-0.99segment of the accuracy diagram in Fig 10.Finally, the closed-form equations especially the EPR model are applicable in the design and construction of landfill liners and subgrade to determine the optimized compressive strength of the compacted earth layer as the foundation course for a flexible pavement at 250 kN/m 2 and compacted earth liner at 200 kN/m 2 strength surfaces to determine the allowable strength for sustainable liner courses for a lime reconstituted gravel-sand-siltclay (G-S-M-C) graded soil, however this is supported by previous research works[1,3,4,65].

Fig 11 .Fig 12 .Fig 13 .Fig 14 .Fig 15 .Fig 16 .
Fig 11.Normal plot of residuals for normal percentage probability and externally studentized residuals for the unconfined compressive strength RSM model.https://doi.org/10.1371/journal.pone.0301075.g011 Fig 21, a 3D surface and FDS graphical behavior of the UCS was presented showing the 3D behavioral effect of selected parameters.It further shows the behavioral consistency between the UCS at 28 days of the lime reconstituted soil and the G versus S, M, and C proportions in the overall mixes.It can be adduced that the UCS improved with higher values of S, M, and C against higher values of the G, which produced a reduction trend in the behavior of the UCS at 28 days.